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, Abstract 

We investigate the asymptotic number of elements of size n in a particular class of closed 
d • lambda-terms (so-called _BC7(p)-terms) which are related to axiom systems of combinatory 

logic. By deriving a differential equation for the generating function of the counting sequence 
we obtain a recurrence relation which can be solved asymptotically. We derive differential 
£C) • equations for the generating functions of the counting sequences of other more general classes 

of terms as well: the class of BCK(p)-terms and that of closed lambda-terms. Using elemen- 
tary arguments we obtain upper and lowerestimates for the number of closed lambda-terms 
of size n. Moreover, a recurrence relation is derived which allows an efficient computation of 
| the counting sequence. BCK(p)-terms are discussed briefly. 

1 Introduction 

Lambda-terms play a prominent role in the theory of computer programming. In order to in- 
vestigate properties of randomly generated lambda-terms we have to know how many terms of 
a given size there are. This paper is devoted to the asymptotic counting of particular classes of 
f^i , lambda-terms. 

Lambda-terms were invented by Church and Kleene in the 30ies (see [21II31II1]) together with 
a set of rules for manipulating them, the so-called lambda-calculus. This is a very powerful formal 
language which can be used to describe computer programs, analyze programming languages or 
investigate decision problems. Moreover, it is the basis of the programming language LISP. 

A lambda-term is a formal expression built of variables and a quantifyer A which in general 
occurs more than once and acts on one of the free variables. It can be described by the context- 
free grammar T ::— a \ (T * T) \ Xa.T where a is a variable. The concatenation of terms 
is called application and adding the prefix Xa to a term is called abstraction. Each abstraction 
binds a variable in the whole term following it and each variable can only be bound by at most 
one abstraction. A term where all the variables are bound is called a closed lambda-term. For 
example, * x) * Xy.y) is a closed lambda-term whereas (Xx.(x * z) * Xy.y) is an open one. 

Our aim is to study the asymptotic number of closed lambda-terms of given size when the size 
is tending to infinity. We define the size of a lambda-term recursively by 

|a:| = l, \Xx.T\ =1 + \T\, |(S*T)| = 1 + |S| + |T|. 

Moreover, note that we will count lambda-terms up to isomorphism: Only the structure of the 
bindings is important; variable names are unimportant. The terms Xy.(Xx.x * Xz.y), Xy.(Xx.x * 
Xx.y), Xx.(Xy.y * Xz.x) are considered to be identical. Observe that the second term is obtained 
from the first one by replacing z by i, which is "by coincidence" the same variable as that in the 
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subterm Xx.x just left to it. But as stated above the important issue is that the last quantifier 
does not bind the variable following it; therefore the name must only be different from y. 

Since the determination of the asymptotic number of lambda-terms seems to be a hard problem 
(cf. the discussion of this issue in p] and for a similar problem in j8l end of Sec. 3]) we confine 
ourselves with the asymptotic analysis of a simpler subclass of lambda-terms and give an outlook 
to the analysis of a larger and more complicated subclass. The classes considered are BCI{p)- 
and i?CA"(p)-terms. The names stem from the correspondence of BCI(1)- and BCK (l)-terms 
to the logical systems BCI and BCK , respectively, which are studied in combinatory logic (see 

unnnins]). 

The plan of the paper is as follows: In the next section we state our notations, definitions and 
some immediate observations. In Section [3] we derive the functional equations for the generating 
functions corresponding to BCI(p)-terms, BCA(p)-terms as well as general closed lambda-terms. 
Then we will derive the asymptotic order of the number of BCI(p)-terms (Section 01 . Section [5] 
is devoted to an upper and a lower estimate for the number A„ of closed lambda-terms of size n. 
This is done using rather elementary arguments, but it is still sufficient to obtain the asymptotic 
main term of logA n . Moreover, we will derive a recurrence relation which allows an efficient 
computation of the numbers A„. In the final section, we briefly discuss BCK (p)-terms. 

The enumeration of £?C/(l)-terms was carried out by Bodini et al. [2] by constructing a 
nice bijection to certain diagrams. It was shown that the number of BC/(l)-terms of size n is 
asymptotically 
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if n = 2 mod 3 and zero otherwise. They obtained also the asymptotic number of BCK (l)-terms 
which differs from ([TJ by a factor e^ 2 ") 2/3 ~s ( - 2ri ) 1/3 . 

Models with a different notion of size (leaves do not count, i.e. they have weight zero) were 
studied in [5j [8]. In [5] upper and lower bounds for the counting sequence were derived and 
questions like typability were discussed. The paper [8] appoaches the counting problem by repre- 
sentations of terms using dc Bruijn indices. They derive a recurrence relations for the number of 
terms with or without constraints on the number of free variables and discuss the issue of random 
generation of terms as well. This allows an efficient computation and experimental analysis of 
term properties like typability of some shape characteristics. 



2 Notation and basic facts 

A lambda-term can be regarded as a so-called enriched tree which is a particular directed acyclic 
graph. In fact, consider a Motzkin tree (i.e., a rooted unary-binary tree) and add directed edges 
connecting a unary node and a leaf such that each leaf is "bound" by a directed edge from exactly 
one of the unary nodes that are its ancestors in the tree. The correspondence is obvious (see 
FigureQ]): leaves correspond to variables, unary nodes to abstractions, binary nodes to applications 




V 

X X 

(Xx.{x * x) * Xy.y) 




Xy.(Xx.x * Xz.y) 



Figure 1: Two enriched trees and the closed lambda-terms corresponding to them. Note that the node 
labels can be omitted, since (Xx.(x * x) * Xy.y) and (Xa.(a * a) * Xb.b) are the same term. 



and the additional directed edges to the binding relations between abstractions and variables. 
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Figure 2: Left: a closed A-term of size 17. Center: a term in BCI(2) of size 14. Right: a term in BCK(1) 
of size 15. 

Clearly, since all leaves are bound, the lambda-term is closed. Of course, open lambda-terms 
can be represented in an analogous manner by a directed acyclic graph where some leaves have 
in-dcgrce zero (that means that they have no ingoing directed edge). 

We will not distinguish between a lambda-term and its enriched tree representation. In addi- 
tion, when speaking of lambda-terms, we will utilize the following abuse of the wording: A unary 
node of a lambda-term is a unary node (i.e. node of out-degree one) of the underlying Motzkin 
tree (i.e. a node becoming unary if all directed edges are removed). These are precisely the nodes 
corresponding to abstractions. Analogously, we call the nodes corresponding to applications bi- 
nary nodes and nodes corresponding to variables leaves of the lambda-term. In a strict sense, 
leaves have always degree one and in-degree one as well (i.e. each leaf x is incident with exactly 
one undirected and exactly one directed edge pointing towards x). 

Moreover, we distinguish between edges, i.e. edges of the underlying Motzkin tree, and pointers, 
i.e. directed edges from a unary node to a leaf. 

Definition 1. • BCI(p) is the set of (non-empty) closed lambda-terms where each unary 
node has exactly p pointers, i.e. binds exactly p occurrences of its variable. 

• BCK{p) is the set of closed lambda-terms where each unary node binds at most p leaves. 

A lambda-term from BCI(p) has three types of nodes: unary nodes (which are actually of arity 
p + 1, as there are p pointers going from this node to leaves), binary nodes, and leaves. The size 
of such a lambda-term is the total number of its nodes. We start with some obvious observations: 

Fact 1. The smallest terms of BCI(p) have one unary node at the root and p leaves. There are p 
pointers from the root to all the leaves. Obviously, if we remove the root and all its pointers, we 
are left with a binary tree. Cleary, their size is 2p. 

The number of such terms is therefore equal to the number of binary trees with p — 1 binary 
nodes and p leaves. This is precisely the Catalan number C p _i = (^Ti ) lv- 

Fact 2. A term of BCI(p) with j unary nodes has pj leaves and pj — 1 binary nodes; its size is 
therefore equal to (2p + l)j — 1. 

3 The generating functions for various classes of closed 
lamb da- terms 

We will enumerate lambda-terms by means of generating functions. Let g n = g^ be the number 
of -BC7(p)-terms of size n and G p (z) be the generating function of this sequence. By Fact 2 we 
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have actually 

i>i 

Analogously, define F p (z) = J2 n >i /« z " an d A(j») = X) ti >i where /„ = /„ is the number of 
BCK{p) -terms of size n and A n the number of closed lambda-terms of size n. 

The next step is the setting up of functional equations for the generating functions. This will 
be done by giving a formal specification of the combinatorial objects and then using the symbolic 
method (see [6]). From [2] we already know that G±(z) satisfies the equation 

d(z) = z 2 + zdiz) 2 + AiGiO?), 

where the differential operator Ai is 2z A D and D denotes the ordinary differential operator. 

Proposition 1. The generating function of BCI(p)-terms satisfies the differential equation 

G p (z) = C p ^z 2 p + zG p (z) 2 + A p G p (z) (2) 



Ap = £ ^L z i+2 P +i D i (3) 
i=i 



where 

with constants oei tP defined by 

e L 1 . ) n ( 2 r)'* < 4 > 

Proof. A £?C/(p)-term can be specified by the formal equation 

T = Su({o}xTxT)U({o}xf), (5) 

where the set S is the set of all smallest BC/(p)-terms (c/. Fact 1) and T a certain set of open 
BC7(p)-terms0 This can be explained as follows: A BC/(p)-term falls into exactly one of three 
categories: It is either 

• a smallest term, 

• or its root is a binary node and the two subterms attached to the root are themselves 
BC/(p)-terms, 

• or its root is a unary node and the subterm attached to the root is a BCI{p)-teiui with 
exactly p free leaves. 

In order to specify all £>C/(p)-terms and avoid ambiguities, we have to take some care in the 
choice of T. Indeed, each _BC/(p)-term will be generated exactly once by the specification ([5]) 
if we generate T by starting with a BCI(p)-teim and then generating p leaves and connecting 
them to the unary root node by a pointer in the following way. To construct a term t 6 {o} x T, 
choose a £>C/-term t and p nodes of t, where multiple choices of a node are allowed. Each node v 
corresponds to an edge, namely the edge leading to v if v is not the root and the edge connecting 
v with the new root (of the term t € {o} x T) otherwise. Thus the choice of the p nodes "hits" 
edges of the term t. Assume that I edges are hit and Sj of them exactly i times. 

If an edge is hit i times, then replace it by a path where at each node of the path a binary 
tree is attached, either to the left or to the right of the path, and the number of leaves of all these 
binary trees altogether is equal to i (see Figure [3] for an illustration of this process) . Thus the 



1 Note the slight abuse of notation in (0: The last cartesian product on the right-hand side is not a cartesian 
product in a strict sense, but only on the level of the underlying Motzkin trees, since we will add pointers going 
from the new root to some leaves of T. 
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replacement creates i new leaves and i new internal nodes. The whole replacement process creates 
exactly Ylf—i is, — p new leaves and p new internal nodes in t. Therefore t has exactly 2p + 1 
more nodes than t and obviously t is a i?C/(p)-term. Conversely, if we have a BCI(p)-term with 
a unary root node, then removing it together with its pointers yields a term with p free leaves. 
These leaves must be children of a binary node since otherwise the parent node must have pointers 
to p descendants which is impossible. Thus the free leaves induce a set of subtrees of t which are 
binary trees with free leaves only. 




Figure 3: To the left, a BCI(4) term with a node pointed once and another pointed 3 times where pointing 
at a node is represented by encircling the dot representing it in the figure. So the root on top is pointed 
at once, the right-most leaf three times. The corresponding hit edges are the thick ones. 

At the right, a possible BCI(4) obtained from the left term. Each thick edge has been replaced by a 
thick path where binary trees have been attached; their leaves are linked to the newly created unary node 
at the root. The root edge (on top of the left term) has been replaced by a path of length two (having 
thus three nodes) and a size one tree has been attached to the left at the middle node; the second thick 
edge of the left term has been replaced by a path of length 3 with two attachments: a size one tree left 
from the second node and a size 3 tree to the right of the third node of the path. 



Now let us count in how many ways this can be done. Each edge which is hit i times is actually 
replaced by a sequence of left or right binary trees. The generating function associated to binary 
trees is T{u) — X) n >i^«-i u " = (1 — v 7 ! — 4u)/2. Thus the number of such sequences with 
exactly i leaves is 

r «i 1 r ii 1 f 2i \ 

\u\ r^r = " , — = 

1 J 1 - 2T{u) 1 J VT^I V * / 

Note that s; of the I edges are hit i times, i — 1, . . . ,p. The number of ways to partition the I 
edges w.r.t. the multiplicity of the hits is ( Si s ) • Then each of the s; edges which is hit i times 

is replaced by one of the possible sequences of binary trees. Therefore there are Ili=i (?) 
ways of doing the whole replacement. Finally, note that choosing I distinct edges corresponds to 
applying the operator z l D l /l\ on the level of generating functions and the 2p+l new nodes created 
during the replacement process yield a factor z 2p+1 . □ 

Proposition 2. Let F(z) denote a formal power series (with real coefficients) , D u — d/du, the 
formal derivative, and U the operator G{u) <— > G(0), G(u) being a formal power series. Then 

A r F W = i^UD lF (-^g) = W (^) ■ 
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Proof. The second equation is obvious since UD^/pl = [u' p ] is exactly Taylor's theorem. For 
proving the first equation set D z — d/dz and f(u) := — 4u = J2 i>0 (^w 1 . Therefore by Faa 

di Bruno's formula (see e.g. [U p. 137]) we obtain 

-tf£>£F(*/W) = — - £ (D Sl+ '" +Sl1 F) (z/(0)) [] -C7£>-(,/( u )) 



Pi 



m— 1 
P 



E?=i is i=p 



/2m 

^ m— 1 v v 



' J E , i.!. . (D Sl+ ~ +s *F)(zf(0)) [] I : 



,2P- 



p 



P 1 / I \ A /2m 



E = A p F(z). 



L 
i=i 

where we substituted si + • • ■ + s p — I in the second line and split the sum according to the value 
of I and used /(0) = 1 in the third line. □ 

Remark 1. More heuristically, we could argue in the following way: Regard F (z) as a generating 
function of a tree-like structure where z marks the number of nodes. Then F(z/y/l — 4u ) is the 
generating function where the nodes are substituted by a node and a sequence of "left-or-right" 
binary trees where the number of leaves is marked by u. Thus [u p ]F(z/\/l — 4u) is the generating 
function of those objects where the binary trees introduced by the substitution altogether contain 
exactly p leaves. The term z 2p+1 accounts for introducing the 2p+l new nodes. This comes from 
counting the nodes of the binary trees coming from the substitution, adding an extra root for each 
of these trees and adding a new root to the total structure. This is precisely what A p does. 

The derivation of the differential equation of the generating function for BCK (p)-terms is a 
little more involved. Note that the differential operator A p corresponds to p pointers from the 
root to some leaves. One is tempted to replace A p in ([2]) by a sum of A;'s to take into account 
less than p pointers. But this is not entirely correct. 

Proposition3. Let F p {z) be the generating junction associated to BCK(p) -terms. Then F p {z) = 
Y[zj{\ — z)) where Y(z) is the unique power series Y{z) — ^2 n > Y n z n with nonnegative coeffi- 
cients which satisfies 

Y(z) = £ C^z 21 + zY(zf + (E A, J Y(z). (6) 
i=i \i=i ) 

Proof. The (in some sense) minimal BCK (p)-terms are binary trees with at most p leaves and a 
unary root node pointing at all the leaves. This gives the first term on the right-hand side of ([6]). 

Note that a unary node may also have zero pointers. A unary node with zero pointers which is 
not on top of the tree cannot be generated directly by a specification similar to ([5]). Therefore we 
first construct terms where each unary node has at least one pointer. Similar arguments as in the 
BCI case then lead directly to ©. Finally, replace the edges by paths which exactly corresponds 
to the substitution z — > z/(l — z). □ 

An alternative approach is to start with Motzkin trees with an additional root having pointers 
to all leaves as minimal structures. The terms with a unary root node can then be generated 
in the following way: Fix the number I of pointers we want to have at the root and then do an 
edge hitting process as in the BCI case. But instead of substituting the hit edges by sequences 
of left-or-right binary trees, use sequences of left-or-right Motzkin trees with an additional unary 
root node (corresponding to the nodes in the paths which substitute the hit edges) such that these 
trees have altogether / leaves. Recalling that on the level of generating functions edge hitting 
corresponds to applying a differential operator, we get in that way a differential equation for 
F p (z). 
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Proposition 4. Let M(z, u) denote the generating function of Motzkin trees where z marks the 
size ( i. e. the total number of nodes ) and u marks the number of leaves. This function is given by 
the unique power series solution of M(z,u) = uz + zM(z,u) + zM(z,u) 2 , that is 



Ml \ 1- z- ^{l- z) 2 -Auz 2 

M(z,u) = — . (7) 

Then F p (z) is given as the solution of 

Proof. This is a direct consequence of the remarks above and Proposition^ □ 

Let A„ denote the number of closed lambda-terms and A(z) = Xm>i ^« z "- Then we can use 
the two approaches presented above to find functional equations for A(z). 



Proposition 5. LetC(z) = (1 — yl — 4z 2 )/2 be the generating function associated to binary trees 
with an extra unary root node and counted by the number of nodes. Furthermore, let A(z) be the 
power series solution of 



A(z) = C(z) + zA(z) 2 + zA [ 1 _ 2C{z) ) - *A(*). (9) 
Then A(z) = A(z/(1 — z)). Moreover, we have 

A(z) = zM(z, 1) + zA(zf + zA ( 1 _ 2 ^ (z>1) ) • (10) 

Proof. To prove ([9]) we can proceed as in the proofs of Propositions [TJ and [3] but allowing an 
unbounded number of edge hits instead. Thus, if A(z) is the generating function associated to 
closed lambda-terms where each unary node carries at least one pointer, then 

AO) = C P -iz 2p + zA(z) 2 + VA(z) 
P >i 

where T> = J2 P >i ^p- Now applying Proposition[5]yields ©■ As in the BCK case, in order to create 
unary nodes carrying no pointers we replace the edges by paths which yields A(z) = A(z/(1 — z)) 
and completes the proof of ©. 

Alternatively, the lambda-terms with a unary root node can be created by starting with 
Motzkin trees with a unary node on top pointing to all leaves. These initial configurations are 
then expanded iteratively by substituting the edges by paths and attaching nodes, either left or 
right, which are (unary) roots of Motzkin trees, each binding all the leaves of its subtree. For an 
illustration of the expansion process, Figure 2] shows one step in this expansion process (not the 
initial one). Figure [5] presents one step of the reverse process. □ 



4 The asymptotic number of BCI(p)-terms 

Recall that G p (z) = Yln>i 5n(2p+i)-i z " ( ' 2p+1 ' )_1 is the generating function of the counting sequence 
of BCI(p) terms. The function G p (z) satisfies the functional equation ([2} which involves the 
differential operator A p given by ([3]) . Our goal is now to get a recurrence relation for the coefficients 
ofG p (z). 
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Figure 4: A step of the grafting expansion of a lambda-term 



Proposition 6. The coefficients g n (2p+i)-i satisfy the recurrence relation 

n-l 



9n(2p+l)-l = E 5((2p+l)-l5(n-l-0(2p+l)-l + Qp( n - l)fl , (n-l)(2p+l)-lj f or n > 2 , (11) 
1=1 

with initial condition 5(2 P +i)-i = Cp-i and where 

^ /n(2p + l)-l 



(12) 



Proof. Obvious, since the first term on the right-hand side of <j2j) only affects the case n = 1, the 
quadratic term is a Cauchy product and A p is a linear combination of powers of the ordinary 
differential operator which acts on the coefficients of the power series exactly as shifting and 
multiplication by Q p (n — 1) do. □ 

Lemma 1. The polynomials Q p (n) can be represented more explicitly as 



Q P (n) = 4? 



Proof. Set f(u) = 1/ yl — 4u. It is easy to see that a m .p = [u p ] (f(u) — l) m and that the coefficient 
on the right-hand side is zero if m > p. Thus we obtain 

Qp( n ) = }^ I m ) am -v 



m— 1 

[u 

4 P 



\ m 

(p+\)n + p 
P 



and we are done. 



□ 
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Figure 5: Finding the original closed lambda-term: first, the unary root node and all the leaves bound 
by the root are coloured white. Then delete the white nodes and colour all their neighbours white. Now 
continue recursively, where deletion of white unary nodes is done by removal and gluing the incident edges 
together. 



The key to the asymptotic analysis is a linearization of the differential equation which is possible 
due to the fast growth of the coefficients of G p (z). We start with an auxiliary result for fast growing 
sequences saying that in the Cauchy product only the extremal terms are asymptotically relevant: 

Lemma 2. Let no e N and A(z) = J2 n >n a « z " be a power series with positive coefficients 
(from index no on). Assume that there exists a > 1 with a n+ i/a n = il^) as n — »■ oo. Then 



*]A{i 



2a„ a„_„ (f + 0(n l7 )) as n — > oo. If we want the second order term, we take the 



next two terms, and so on. 

Proof. Define q(n) = a n+ i/a n ; then l/q(n) = 0{n~ a ). W.l.o.g. assume that n is odd. Then the 
coefficient of z n in A(z) 2 is 



J2 aiC 

I— n 



[n/2] -no 

2d no ci n — no 2 ^ ^ CL no -\-jCi n — no —j 



2,d nQ (X n - 



i + 



i=i 

Ln/2j-n 



q(n )q(no + !)■■■ q(n Q + , 



q(n — no — l)q(n — no — 2)...q(n — no — I) 



i=i 

In the case where n is even we have to subtract l{ n /2eN} a ^/ 2 011 the r.-h. side. 

The first term of the sum in the last line is q(n )/q(n — no — 1) = 0((n — n a — 1 )~°") = 0{n~ a ) 
(recall that n is a constant). The further terms are of order 0(n~ 2a ) and there are not more 
than \n/2\ of them. Thus the sum is of order 0(n 1_2<T ) = 0(n _<T ). Hence 



2a„ a„_„ (f + 0(n- ff )) - [z n }2a no z n ° A{z) . 



\z n ]A 2 {z) 

We are now ready to derive bounds for the coefficients of G p (z). 
Lemma 3. Define 4> n = g n (2 P +i)-i> ( n > !)• Then we have <fin+i/<f>n = Q(n p ) as n — >• oo. 



□ 
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Proof. By (fTTj) we have <j>i = C p _i and, for n > 2, 

, (fri&n-i-i + Qp(n — l)(j) n -i. (13) 



71-1 



Z = l 

Thus 0„ > Q p (n— 1) 4> n -i- By Lemma[T]it is obvious that Q p (n) is a polynomial in n with leading 
term ?-J2E+D— n p which implies the result. □ 

Corollary 1. For p > 1, the sum Ylm+l=n-i ^m^l * s asymptotically equal to 2</>i</>„_i(l + 
0(1/^)). 

Remark 2. The intuition behind the considerations above is as follows. From our study of BCI(1) 
and from bounds already obtained (although for a different model) [5], we already know that 
the asymptotic behaviour of the number of lambda-terms widely differs from that of the number 
of trees: the signifiant increase in the number of lambda-terms of given size when compared to 
Motzkin trees, i.e. the trees forming the underlying structure of lambda-terms, comes from the 
large numbers of ways of binding leaves to unary nodes; indeed we are dealing here with directed 
acyclic graphs. Hence the role of the term G^, which corresponds to the "purely binary tree- 
like" structure, is asymptotically negligible when compared to that of the differential term which 
captures the binding of leaves. 

Remark 3. The exact differential equation for G p {z) is ([2|) whereas the arguments in Remark [2] 
show that we may work with the linearizecH equation 

L p (z) = C p - lZ 2p + A p L p (z). (14) 

The linearized equation has a combinatorial interpretation as well; indeed, it counts the number 
of structures S defined as follows: The smallest possible structures of S are precisely the smallest 
-BC/(p)-terms, i.e., a unary root followed by a binary tree with 2p — 1 nodes (and pointing to 
all leaves of this binary tree). All terms in S have a unary node as their root. To contruct 
larger terms, we add a new root and expand the subterm below using the same edge hitting and 
expansion process as for BC I (p) -terms. Thus these terms may have binary nodes, but never as 
root. 

Lemma 4. Forp> 1, the sequence (<p n )n>i satisfies 

n-1 

2</>l</>n-l < 2J Mn-l-l < 2010„_1 + (n - 3)02</>,j-2- 

1=1 

Proof. The lower bound is obvious: we just keep the first and the last term. Set q(n) = (f> n +i/ ' <t>n- 
To prove the upper bound, note that (4> n )n>i is monotonically increasing and that for any 1 < 
i < l~( n — 3)/2] we have 

A> A. ^ g(2)g(3)---g(l + i) 

(p2 +i (pn-2-i = <P2<Pn-2—, ^— ; -. : :r > <p2<Pn-2- □ 

q[n — 2)q(n — 3) • ■ ■ q{n — 1 — 1) 



Next we turn to the linearized equation (JX4J) . 
Theorem 1. Set l p n — [z n ]L p (z) where L p is given by (fl4"|) . Then, for fixed p and n — > oo, 

V~S p ^-V"(n-l)P 



2 This is not a linearization in a strict sense; we did not replace the quadratic term by a linear one, but only 
omitted it. 
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p 


a p 


Ap 


2 


1.048668... 


0.981017... 


3 


1.0046726194... 


2.19232485... 


4 


1.0006911656... 


6.17349476... 


5 


1.0001221936... 


19.2515312.. . 



Table 1: The first few values of a p . 



where 



B P = C^X{ } (15) 

=i i yx -t- 2p+1 j 

= C p _ 1 exp('-^l^ log(r(a:)) darVl + O , asp^oo, (16) 

w Cp_i (1.0844375142 )( 2 p+ 1 )/ 2 N + O 



and 



Proof. Equation (|14|) implies £ p .2p = Cp-i and = Q p (n — l)£ p . n -2 P -i for n > 2p. Thus 



n-1 



^p,(2p+l)ri-l — Cp_i JJ^ Qp(j) 



3 = 1 



<M^) n 



fir(i + ») 

= <ws-i(„- ^ n n (i + ^rr 1 ■ 7) 

Finally, note that, as n — >• 00, 



which completes the proof. The asymptotic form (|16p can be obtained by Euler-McLaurin's 
formula. □ 

Theorem 2. For p > 2, i/ie number of BCI(p)-terms of size (2p + l)n — 1 is asymptotically 

Ap^n^in-iy.P 

where /3 P and j p are as in (fl~7|) and A p — a p B p with B p as in (fl~5|) and a p = 1 + 0(l/(pe p )) ; as 
p — > OO. 

Remark 4. The first few values of the constants a p and A p appear in Table [TJ 
Remark 5. Applying Stirling's formula we get the alternative form 

App;- 1 n^n np 

where 

a Pv_ - _ ~ 5 P 

Pp — „ 1 7p 



eP 4p + 2 

and Ap = {2Tt/e 2 y/ 2 A p . 
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Proof. From the recurrence relation for (f> n , Eq. (|13p . we have 



n-l 



K = <f>7i-lQp(n - 1) + 2J Mn-1- 



1=1 

= 4> n -i {Qp{n - 1) + r n _i), 

with r„_i = J2i<i<n-i Mn-1-1 / 4>n-i and Q p {n) defined in ((H]). Thus 

n— 1 n— 1 

^ = 0i n WpCj) + = n qpCj) 



where K p {n) — nj=i (l + q (j) J' ^ or P - ^ we have Q(ri) — il(n p ) and furthermore Lemma [4] 
gives r„_i = 2<j)i+0(l/n p ~ 1 ) = 2C p _i+0(l/n p ~ 1 ). Hence the sequence (K p (n)) n >i is convergent 
and we get 



' n-l 



\ j=1 

where a p = C p -\ ■ limn^oo Kp{n). The product C p _i Y¥j=i QpU) i s already evaluated in (|18[) . 
yielding the asymptotic behaviour of the solution of the linearized equation given in Theorem [T] 
The difference between the linearization and the 4> n is hidden in the constant a p . Thus we 
are left with the determination of a p . We will confine ourselves with an asymptotic evaluation for 
p — > 00. 

First note that Lemma [T] immediately implies the inequality 



Qp{n)> 2P{2p + l)P nV. (10) 



Now observe that Li = and that by Lemma |4] we have Tj < 2<\>\ + (j — 2)<p2<pj-i/<pj- The 
quotient in the last term was already estimated in the proof of Lemma [3] by <fij-i/<ftj < 1/Q p (j). 
Using this estimate as well as the inequality (fTUt we obtain (for j > 1) 



1 i < 201 + 1 ; r = ZO,,_1 + 7- 



2P(2p+l)PjP p 2P(2p+l)PjP' 



Hence we get 



^=n( i 

i>2 



CP 



(i) 



<TT(l+ Cp ~ ip] + 02P! 

11 I 9Pf9r>-l- LlP-iP 



3>2 



2P(2p + 2 2 P(2p + l)2pj2p-i 



" n ( X + 2P(2p + 1)^ ) Q ( X + 2 2P(2 P t [ypfp-i ) • (20) 



The two products above turn out to be of the form Y[j ( 1 + J with e — > as p — > 00. Thus we 
can easily estimate them by 

(1 + 1) = EE ^ A - E ^#&*). 
j v j 7 j fe>i j *; 

Since ((x) = 1 + 0(2 -JB ) as 1 ^ 00 we obtain JJj U + ff) = 1 + °( £ p)- Now turning to (HOI) we 
have 

C p-W- !_ and f 2p) 

2P(2p+l)PjP peP y2i 2 2 P(2p+l) 2 Pj 2 P-i 1 ' 
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where we used </> 2 = <friQ p (l) = C p _i4 p ( 2p p 1 ) for the second estimate. This implies a p = 1 + 
0(1/ (pe p )) which completes the proof. □ 

5 Closed lambda-terms 

So far, we are unable to determine the asymptotic behaviour of A„. We will derive upper and 
lower estimates and a recurrence relation which allows an efficient computation of A„ . 

5.1 Estimates for A ra 

The number of BCI(p)-terms is certainly a lower bound, but using rather crude and elementary 
estimates a better bound can be obtained. 

Theorem 3. The number X n of closed lambda-terms of size n satisfies for every e > and for 
sufficiently large n the inequalities 

. An \ n/2 ^^n~ , /9(l + e)n\™ /2 (logn)™/( 21o s™) 

c\ — < A„ < c 2 ' 



s e log n) n ~ ~ \ e log n J n 3 / 2 

where ci,c 2 are some positive constants. 

Proof. We determine the lower bound by counting particular lambda-terms of size n. Take a 
binary tree with n/ leaves and attach to its root a string of n u unary nodes. Then connect the 
leaves to the unary nodes by pointers. Each such object is a closed lambda-term and there are 
C„ / n" / such terms. Note that n u = n + 1 — 2n/. Hence we obtain 

n-l 

(n+l-h u )/2 



where h u and n / are those values of n u and n / , respectively, where n™ / attains its maximum. The 
maximum is attained at n u = n/W(en) where W(n) is Lambert's VF-function defined implicitly 
by W(n)e w ^ = n. It is easy to show that 



W(en) — log n — log log n + 1 + O 



log log n 
logn 



This implies 
Hence we obtain 



<™u< 7— r-T— ■ (21) 



logn 



log n log n — log log n ' 



(n/2).(l-l/(log n-log log n))+l/2 



^/2 



■ cxp ( — — — — (log n — log log n) 



log nj y logn \ 2(logn — log logn) 

The lower estimate now follows from C r <~ k\A r /r 3 / 2 (r — > 00) where k\ is some positive constant. 

For the upper estimate we construct a set of objects such that a proper subset corresponds to 
the set of all lambda-terms of size n. Take a Motzkin tree and add pointers such that each leaf is 
connected to an arbitrary unary node. Clearly, each lambda-term is generated in that way. But 
since leaf x might be bound to a unary node which is not on the path from x to the root, we 
generate also enriched trees which do not represent a lambda-term. Therefore we get the upper 



13 



bound A„ < M n ma,xn u * where M n is the number of Motzkin trees with n vertices. As above we 
have n u — n/W(en). Now (f^T|) implies that for sufficiently large n we have 



log n — log log n / 
(1 + e)n\ % / n \~^^ 



log n J \ log n 

(l + e)n\ 2 /nloglogn 
exp 



< 



e log n / \ 2 log n 

where we used log nj (1 + e) < log n — log log n for sufficiently large n. Finally, the well known fact 
M r ~ /c23 r /r 3 / 2 (as r — ► oo and with some constant &2 > 0) completes the proof. □ 

Remark 6. If A„ is the number of closed lambda-terms where the sum of the number of unary 
nodes and the number of binary nodes equals n (so leaves do not contribute to the size), then 
David et al. [5] showed the following result for the growth rate of the counting sequence: 

(4-e)n\"-" /log " - f (12 + e )r^"- ,l/31os " 

S A n < 



log n / \ logn 

Thus the exponential growth is similar to that of A„, although the underlying model is rather 
different. 



5.2 A recurrence relation 

Eq. dTU|) immediately implies that A„ satisfies the recurrence relation 



a„ = m„_i + 2J ^tK + X! <W^ ( 22 ) 



where M n = [z n ]M(z, 1) is the number of Motzkin trees of size n and 



with £ a>r :— [z s ](2zM(z)) r . Note that ( s , r = unless s > 2r and thus 

l(n-e-i)/2] 

(' 

l-l 



r=0 

By Lagrange inversion we obtain 



n— t— l,r- 



r \ s — r 



Ur = r[z s - r ]M( z y = 2 r — V , 

s — r , / — ' \a, o, c 

a,b,c: b+2c=s-2r v ' ' • 

which gives after a few computations 

5nl - h ^*>(*-r)I (n-/-l-2t)I (23) 

f r2 T ( i ~ 1+r )(n—l~2~r)\ 

Now, set := X)r=o ti (t-r)! (n-i-1-2 1)! ' ' This sum is amenable to creative telescoping 

(see [15) ) which yields a system of two recurrences of order one for the multi- index sequence 

{b n ,e,t)n,e,t>o- 

{-I 2 - 2nt -2Zt-l-n + n 2 ) b n . Lt + (2£t -2n£ + 2£ 2 + At) b n j +ht 
+ (-At 2 -2t + Ant -A£t-n 2 + 2n£-£ + n- I 2 ) b n+i tt = 



14 



and 



(2n -t-2)(n-£-2t-2)(n-£-2t-l) bt, n>t - t (t + 1) (n - £ - t - 2) b^ n>t+ i 
-(n-£-2t- 2)(n - £ - 2t - 1) (n - £ - 2i) = 0. 

with the initial conditions given by the sum representation of b n ^.t- This system can be solved 
explicitly and we get 

_ 2£ T(n-£-2) 2 F 1 (-t+l,£ + l;-n + £ + 3;2) 
nU ~ T T{t) 2 T{n-£-2t) 

Now, using the same techniques, we can also obtain a system of two D-fmite recurrences for 5 Ui f. 

(n - £) (n + 1 - £) (n - 21 - 2) 5 n+2 j -(n-£) (2n 2 - Qn£ - 5n + 2l 2 + 3£ + l) <5„ +M 
- (n - 1) (3n 2 - 2n£ + n - £ 2 - U - 8) <5 M + 20 (n - 1) £ {£ + 1) <5„, £+2 
+ 2 (n - 1) (5n - 9^ - 12) £5 n , e+1 = 

and 

(n - £) {£ - n - 1) 6 n+2 j + (n-£) (2n - £) 6 n+hi -£(n-l) 6 n+1 j +1 
-U{n- 1) S n j +1 (n - 1) (3n -2^+1) <5„, f = 0. 

Unfortunately, this system does not admit an explicit solution in terms of classical special func- 
tions. Nevertheless, it allows us to calculate efficiently the values 5 n .e. 

6 Conclusion and outlook 

The motivation for our analysis was the enumeration of closed lambda-terms. Since the problem 
seems hard, we treated the subclass of i?C/(p)-terms which imposes quite a restriction on the 
degrees of freedom in binding variables by quantifiers. Thus we expected the set of £?C/(p)-terms 
to be small in comparison to the set of closed lambda-terms. Our results verify and quantify 
this. Moreover, they show that the restriction is weaker than bounding the unary height, i.e., the 
maximal number of unary nodes on a root-to-leaf path. Indeed, if the unary height is bounded 
by L, then the asymptotic number of terms of size n is in general C?i~ 3 / 2 p n ; i.e. the asymptotic 
behaviour is like that of the number of Motzkin trees. Only for particular values of L, the 
asymptotic behaviour becomes Cn~ 5 / 4 p™ (see [1]). This behaviour changes if the condition on 
the unary height is replaced by a condition on the number of pointers per unary node (as in 
the BCI(p) case) or dropped completely (closed lambda-terms). So these structures are indeed 
different from tree-like structures; their counting sequences grow much faster than that of Motzkin 
trees. So we can conclude that the enumeration of BCI (p)-terms is not only of interest in its own 
right, but also more closely related to the original counting problem than to tree enumeration. 

Since the union of all the sets of BCK (p)-terms, p = 1, 2, . . . , is precisely the set of closed 
lambda-terms, one might be tempted to approach the problem of determining the asymptotic 
behaviour of A„ via the number of BCK (p)-terms and letting p — > oo. For performing such a limit 
we needed precise and uniform asymptotics for the number of BCK (p)-terms. Unfortunately, the 
asymptotic computation of the number of BCK (p)-terms turns out to be much more involved 
than that of the number of BCI(p)-terms. A precise analysis of the BCK case is beyond the 
scope of this paper and will be the topic of a forthcoming paper. Here we discuss only briefly how 
to attack this problem. 

The differential equation © implies a recurrence relation for the coefficients of F p (z). This can 
be linearized in a similar fashion as we did in the BCI case (essentially Lemmas OH]) • The next 
step will be showing upper and lower estimates for f n :— [z n ]F p (z). This enables us to identify the 
asymptotically dominant term in the recursion which yields a rough information on the growth of 

fn- 



15 



The task is now to find the asymptotic behaviour of the correct solution. The growth rate of 
the coefficients tells us that the Borel transform F p (z) of the generating function F p (z) must grow 
exponentially in z. This indicates that F p (z) is Hayman-admissible (cf. [9]) and therefore a saddle 
point analysis applies and eventually yields the asymptotic number of BCK (p)-terms. 

When studying not only the size but further properties of BCK (p)-terms by means of multi- 
variate generating functions, the above remarks suggest that these functions will be (multivariate) 
Hayman-admissible such that a multivariate saddle point method applies (cf. [7]). 

As in the case of closed lambda-terms, the functional equation ((8]) corresponds to a recurrence 
relation of the form (f22|) . The only difference is that <5 Tlj ; in (|23p has to be replaced by 



*».i= E E 



t\ (t-r)\ (n-l-l-2t)\ 

t=0 r=0 \ J \ J 



Similarly as before, this gives rise to a system of D-finite recursions. 

Acknowledgement. The authors thank Marek Zaionc for triggering the authors' interest in the 
subject and for numerous fruitful discussions about it. 
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